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Abstract 

We study the entropy of the set traced by an n-step random walk on Z rf . We 
show that for d > 3, the entropy is of order n. For d = 2, the entropy is of order 
nj log 2 n. These values are essentially governed by the size of the boundary of the 
trace. 



1 Introduction 

A natural observable of a random walk is its range, the set of positions it visited. In 
this note we study the entropy of this range - roughly, how many bits of information 
are needed in order to describe it. We calculate the entropy of the range of a random 
walk on Z d , d G N, up to constant factors. 

1.1 Main Result 

Let S(0), . . . , S(n) be a simple symmetric nearest-neighbor random walk on Z d , d G N, 
of length n. Define the range of the random walk to be 

R(n) = {S(0),S(l),...,S(n)}, 
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the set of vertices visited by the walk. 

In this note we study the entropy of R{n) as a function of n (for formal definition 
of entropy, see Section 12. ip . We calculate the value of the entropy, H(R(n)), up to 
constants, precisely: 

Theorem 1. For d = 2 there exist constants c 2 , C% > such that for all n6N, 

n n 
c 2 -^— < H(R(n)) < C 2 -^— 
log (n) log [n) 

and for d > 3 there exist constants Cd, Cd > such that for all n £ N, 

c d n < H(R(n)) < C d n. 

The proof of Theorem [T] is organized as follows: we first prove the lower bound which 
is easier and follows directly from estimates on the size of the boundary of the range; 
in two dimensions the boundary of the range of the walk is of order n/log 2 n, and in 
higher dimensions it is linear in n. This is done in Section 12.21 We then show the upper 
bound which requires a certain renormalization argument. An interesting feature of the 
procedure is that at each step of the renormalization process, the number of "active" 
boxes is not determined by examining the previous renormalization step, but rather 
globally. This is done in Section 12.31 

The one dimensional case is not difficult. 

Exercise. In the case d — 1, there exist constants c\, C\ > such that for all n £ N, 

ci log n < H(R(n)) < Ci logn. 

Acknowledgements. We thank Eric Shellef for useful discussions. We also thank 
Elchanan Mossel for his help with the construction in Section 13.11 

2 Entropy of Random Walk 
2.1 Entropy 

Here we provide some background on entropy. Let X be a random variable taking values 
in an arbitrary finite set Q. For x £ f2, let p(x) be the probability that X = x. The 
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entropy of X is defined as H(X) = E[— \ogp(X)] (all logarithms in this note are base 
2). For two random variables X and Y, the conditional entropy of X conditioned on Y 
is defined as H{X\Y) = H(X, Y) - H(Y). 

Proposition 2. The following relations hold: 

(i) . H(X) < log|fi|. 

(ii) . For every function f, H(f(X)\X) = 0. 
(in). H(X) < H(Y) + H(X\Y). 

For more information on entropy and for proofs of these properties see, e.g., [H Chapter 
2]- 



2.2 Lower Bound 

Notation. By P 2 and E 2 we denote the probability measure and expectation of the 
random walk conditioning on 5(0) = z. We denote P = Po and E = Eo. Let z,w e Z d 
and A C Z d . Denote by dist(z,w) the graph distance between z and w in Z d . Denote 
dist(z, A) = inf {dist(2i, a) : a G A}. We write z ~ if dist(z,u;) = 1, and z ~ ^4 if 
dist(2, A) = 1. The inner boundary of A is defined as 

dA = {z e A : z ~ Z d \A} . 

Let p n (A) = F[R{n) = A}. 
Lemma 3. For every A C Z d , 

Pn(^) < (l 

Proof. Let To = and define inductively for j > 1, 

Tj = inf {t > + 1 : S(t) G OA} . 
By the strong Markov property, for any j < \dA\, 

P [S (Tj + 1)?A \ S(0), S(Tj), Tj < oo] > i-. 



1 \ \9A\-1 

2d) 
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The event A C R{n) implies that Tj < n for all j < \dA\. The event R(n) C A implies 
that S(Tj + 1) £ A for all j < \dA\ - 1. Let be the event that SfTj + 1) e A and 
T J+1 < oo. Thus, 

\9A\-1 

P [fl(n) = A] < P [ p| Ej 



3=1 
\BA\-1 

< n p i ^i. 



1 x 

2d/ 



□ 



Lemma [3] shows that in order to lower bound the entropy of the random walk trace it 
is enough to lower bound the expected value of the size of the inner boundary of the 
random walk trace. 

Corollary 4. H(R(n)) > - log (l - 53) ' E [I<^WI - 1]- 

The following lemma gives the lower bound for the entropy of the random walk trace. 
Lemma 5. For any d > 2, there exists a constant c d > such that for all n £ N, 

C2 i^#W d = 2 > 



c d n 



H(R(n)) > 

Proof. By Corollary HI it suffices to show that 

E[\dR{n)\] > 



d>3. 



C2~, — T7 — r d — 2, 

z log 2 (n) ' 



c d n 



d>3, 



for some constants c d > 0. For z £ Z d , define T z = inf {t > : S{t) = z}. By 
Lemma 19.1 of and by the transience of the random walk for d > 3, there exist 
constants c d > such that for any z ~ w £ Z d , 



F Z [T W >n]> 



jSS- d = 2, 

log n 

Cd d > 3. 



Denote the right-hand side of the above equality by fd{n). Using the strong Markov 
property at time T z , for any z ~ w £ Z d , 



P [z £ &R(n)] > P [T z < n , T w > n] > f d (n) P [T 2 < n] 
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This proves the lemma, since 

E[\dR(n)\] > f d (n) ^ F[T Z <n}= f d (n)E[\R(n)\], 



and since 

E[\R(n)\] > 



Co ' i d — 2, . 

4 log n 



c' d n d > 3, 

for some constants c' d > (see, e.g., Theorem 20.1 in [1]). □ 
2.3 Upper Bound 

We now show that the lower bounds on the entropy of the random walk trace given by 
Lemma [5] are correct up to a constant. The transient case is much simpler than the 
two-dimensional case. 

Proposition 6. For d > 3, there exists a constant Cd > such that for all neN, 

H{R{n)) <C d -n. 

Proof. Let Vl = {A C 7L d : p n (A) > 0}. By clause (i) of Proposition [2] it suffices to 



prove that |0] < (2d) n . This follows from the fact that the number of possible n-step 
trajectories in Z d starting at is (20?)"". □ 

2.4 Two Dimensions 

We now turn to the two-dimensional case, which is more elaborate. 
For 2 6 Z 2 , we denote by the L 2 -norm of z. Denote 

T z , r = inf{t>0 : \\S(t)-z\\<r}, 

and denote T r = T^ r . Also denote 

r 2jr = inf {t > : \\S(t) - z\\ > r} , 

and denote r r = T$ r . 
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2.4.1 Probability Estimates 

We begin with some classical probability estimates regarding the random walk on Z 2 , 
which we include for the sake of completeness. 

Lemma 7. There exists a constant C > such that for all neN, 



E 



max IISYA;)!!' 

0<fc<n 1 



< Cn. 



Proof. Let S(k) = (X(k),Y(k)), so \\S(k)\\ 2 = \X(k)\ 2 + \Y(k)\ 2 . Doob's maximal 
inequality (see, e.g., [SJ Chapter II]) on the martingale X(k) tells us that 



E 



max \X(k)\ 2 <4E[\X(n)\ 2 }. 

0<k<n J 



The martingale \X(k)\ 2 — k/2 tells us that E [|X(n)| 2 ] = n/2, which completes the proof, 
since X(k) and Y(k) have the same distribution. □ 

Lemma 8. There exist constants c 1; c 2 > such that for all n eN and X > 0, 



P 



max \\S(j)\\>\ 

l<j<n 



X 2 , 

< ci • exp ( -C2— ) • 



Proof. This is a consequence of Theorem 2.13 in [1]. □ 

Lemma 9. There exists a constant c > such that the following holds. Let T = T^ . 
Then, for zeZ 2 and r > 2 ||z||, 

clog(r/ \\z\\) 



F, [T < r r ] > 



logr 



Proof. Let a : Z 2 — >• [0, oo) be the potential kernel defined in Chapter 1.6 of [2\. That is, 
a(0) = 0, a(-) is harmonic in Z 2 \ {0}, and there exist constants Ci, c 2 > such that for 
any z G Z 2 \ {0}, a(2;) = Ci log ||2;|| + c 2 + 0(||2;|p 2 ). Since a(-) is harmonic in Z 2 \ {0}, 
if r > ||2;|| then a(S(t)) is a martingale up to time T' = min {T,r r }. Thus, 

a(z) = (1 - F z [T < r r }) ■ E z [a(S(T')) \ T > r r ] , 

which implies 

^ i ci log ii^ii + c 2 + o(iizir 2 ) „ 

C\ logr + c 2 + C(r z j 
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We also need an upper bound, 

Lemma 10. There exists a constant C > such that for every z G Z 2 and r, R such 
that 1 < r < \ < 

log(it/r) 

Proof. Using the potential kernel from the proof of Lemma [9] with the stopping time 
min{T r ,T/j}, there exists a constant C\ > such that 

ci(logi2 - log ||*||) + 0(R^ + ll^lp 2 ) 



<C- 



ci(logi? — logr) + 0(r~ 2 ) 
\og(R/\\z\\) 



\og(R/r) 

for some constant C > 0. □ 

Lemma 11. For any < a < 1, there exists a constant C > such that the following 
holds. Let z El? such that \\z\\ > 1/a. Then for any n£N such that n > ||^|| 4 , 

c 



P, [T aM > n] < 



log(n/ ||z|| ) 



Proof. By adjusting the constant, we can assume without loss of generality that n/ \\z\\ A 
is large enough. Let r = a\\z\\ and R = n 1//4 . Using the potential kernel from the proof 
of Lemma [9] with the stopping time T' = min {T t ,tr\, 

P, [Tr > r R] < ^M/l^Q < - — j- (2 ,) 
Cilog(i2/r) +0(r x ) log(n/ ||^|| ) 

for some constant C\ = C\{a) > independent of z and n. Also, considering the 
martingale ^(t)!) 2 — t up to time t r shows that E 2 [tr] < l) 2 . Thus, by Markov's 
inequality, 

P, [th > 7i] < - 7 =. (2.2) 



(12.11) and (12. 2p together prove the proposition, since 

P, [T r > n] < F z [T r > t r ] + F z [t r > n] . □ 
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Lemma 12. There exists a constant C > such that for all n e N and 1 < r < \\fn 
the following holds. Let z e Z 2 be such that > y^n. Then, 

C 



W z [T r <n}< 



log(n/r 2 ) 



Proof. For m > 1, let A m be the event {r m || z || < T r < r( m+ i)|| z || < n}. The family {A m } 
consists of pairwise disjoint events, and 

oo 
m=l 

For every m > 1, using the strong Markov property at time T m || z ||, 

P Z [An] < P*[Tm||x|| < n ] ' max {^x[T r < T( m+ i)|| a ||] : 771 \\z\\ < \\x\\ < Ul \\z\\ + l} . 

By Lemma [HI there exist constants C\, c-i > such that 

Pz[ r m|j.z|j <n] <F Z max ||5'(j)|| > m \\z\\ — \\z\\ < C\ exp (— C2?n 2 ) . 

By Lemma [TUl for any i£Z 2 such that m ||z|| < ||x|| < m \\z\\ + 1, 

F x [T r < r( m+ i)|| a ||] < - — 3 , 
log(n/r 2 ) 

for some constant C3 > 0. Summing over all m > 1, 

00 

P*[T r < n] < - . C3 . V ci exp (-c 2 • m 2 ) . □ 
logm/'H) v ' 

v ' ' m=l 

2.4.2 Upper bound in two dimensions 

For z e7? and k E N, let <2(z, fc) = {z + (j, /) : -k < j,f < k}\ i.e., Q(z, k) is the 
square of side length 2k + 1 centered at 2;. For a path x(0),x(l), . . . ,x(n) in Z 2 , we 
denote by x[s, t] the path x(s), x(s + 1), . . . , x(t). 

Lemma 13. There exist constants c, C > such that for alln, k £ N such that k < n 1//4 ; 
and all z e Z d suc/i f/iai ||z|| > 5-y/n, 

P LR(n) n Q(z, k) ^ 01 < exp f - c^^) . 

logn V n / 
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Proof. Let A = ||z|| — 2y/n. Let T be the first time the walk S(-) started at hits 
Q(z, k). Then r A < T 2 2 fc < T. By Lemmas 151 and [T2| 

P [R(n) D Q(z, k) j£ 0] < P[r A < n] • max {F x [T Zi2k <n] : A < ||x|| < A + 1} 

Cl 



logn 



<P max \\S(j)\\ > A 
ii i- 

C 2 / 2 

< ; exp - c 3 

log n V n 

for some constants C\, c 2 , c 3 > 0. □ 

Lemma 14. There exists a constant C > snc/i £/ia£ £/ie following holds. For all 
n,k<EN such that k < n 1//4 , and all z G Z d suc/i i/iai 1 < ||^|| < Sa/tz, 

p[i?HnQ(z,fc)^0]<c log(1O ^ /NI) 



logn 

Proof. By adjusting the constant, we can assume without loss of generality that ||z|| > 
3k. Let Q = Q(z, k). Define a = 0, and for % > 1, define 

a 4 = r 10 ^ = inf{t >0 : > 10SAT}. 

The event {.R(n) fl Q ^ 0} is contained in the event 

{S[0, a,] n Q ^ 0} U |J {Sfo, <r i+1 ] n Q ^ , <t, < n} . 

i>l 

Since 3fc < ||z|| < By'n, we have that the event {S^O, a±] fl Q 7^ 0} implies that the 
random walk started at hits the ball of radius 2k around z before exiting the ball of 
radius 20y/n around z. Translating by minus z we get by Lemma [TU1 that there exists a 
constant C\ > such that 

P [S[0, a x ] fl Q * 0] < P_ 2 [T 2fc < r 20VE \ < C x • 1°^ 10 ^/ MD 



logn 

Fix z > 1. By Lemma [H 



k- < nl < P 



max ||5C7') || > VVy/n < C 2 ■ exp ( - C 3 • 10 2i ). 
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for some constants C2, C3 > 0. Using Lemma [TU] again, 

¥[S[a h a i+ i] H Q ^ | ct; < n] < 

logn 

for some constant C4 > 0. Therefore, 



F[R(n) n Q ^ 0] < Cl ■ ^f 1 ' M) + ^ Y) exp ( - C 3 ■ 10*) . □ 

log n log n ^— ' 



We have reached the main geometric lemma, 

Lemma 15. There exists a constant C > suc/i that the following holds. Let n, k e N 7 
/et Q = Q(0, A;) and let z ~ Q. Then, 

F z \dR(n) n Q ^ 0] < C • 

logn 

Proof. Without loss of generality assume that log 2 fc < logn. Define Q+ = Q(0, k+1). 
So Q + contains the union of Q with all vertices that are adjacent to Q. Define To = 0, 
and inductively 

<Tj = inf {t > Tj : \\S(t)\\ > 10k}, 

T j+ i = inf {t > Gj : S(t) G Q + } . 

If Q + C R(n) then dR(n)C\Q = 0. Thus, it suffices to upper bound the probability of the 
event {Q + <£_ R(n)}. With hindsight choose m = [log A; • logn]. Set Vj = — Oj > 

2^} and Uj = {Q + <f_ R((Tj)}. We prove the following inclusion of events 

m— 1 

{Q+ £ i?(n)} C {(T > n/2} U C/ m U \J {U s fl ^). (2.3) 

i=o 

Assume that the event on the right-hand side of (12. 3p does not occur; i.e., assume 
that a < n/2, that U m , and that for all < j < m — 1, f/j U V,-. Let J = 
min |0 < j < m : £/, }. Consider the following cases: 

• Case 1: J = 0. Then Q + C R(a ). Since cr < ^/2, we get that Q + C i?(n). 
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• Case 2: J > 0. Since we assumed that U m , we know that 1 < J < m. By the 
assumption (Uj U Vj), we have that <7j + i — aj < n/2m, for all < j < J — 1. 
Since we assumed that a < n/2, we get that 

j-i 

oj = cro + ^2 a i +l ~ °i < n - 

j=0 

But J was chosen so that Uj occurs, so Q + C -R(cxj) C R(n). 



This proves (12.31) . 

Fix j > 0. The martingale \\S(t) — z|| 2 — t shows that E^CTj — Tj | ^(tj)] < C\k 2 for 
some constant C\ > 0. Using Markov's inequality, 



P. 



n 

-Tj> 



< 



n 



(2.4) 



for some constant C*2 > 0. By Lemma [TTj there exists a constant C3 > such that 

C 3 



P- 



Tj+l (\, > — 



n 

4m 



< 



log n 



(2.5) 



The two inequalities, (12. 4p and fl2.5j) . imply that 

p, ft I < 



logn 



(2.6) 



for some constant C4 > 0. Using Lemma [9], there exists a universal constant C5 > 
such that for any x G Q + , 



P, [xG%„a,] j ^(r,)] > 



log 



Thus, 



F z [Uj] = P 2 [Q + Rfa)] < min {l, |Q+| • (1 - C 5 /log^ +1 } 

< min{l,C 6 A; 2 exp(-C , 5 (j + l)/logfc)}, (2.7) 
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for some constant C 6 > 0. Plugging (J23D, fl2~£D and ([277]) into yields 

if 

F z [Q + <£ R(n)] < F z [a > n/2] + P, [U m ] + ^ P, , V)] + ^ F z [U j , V s ] 

j=0 j>K 

^ c fk 2 , n -c 8 , y 1 , y fc 2 exp(-C 5 (j + l)/logfc) 
~~ V n r— ' log n log n 

j=0 b j>K 6 

Co log 2 

< ° S , (2.8) 
log n 

where K = [4 log 2 k/C 5 ~\ and C 7 , C 8 , C 9 > are constants. □ 

Definition 16. Define A(Jfc) = {(2k+l)z : 2 G Z 2 }. The collection {Q(z, &)} z6A(fe) 
consists of disjoint squares that cover Z 2 . For k,n G N and z G Z 2 , define I(z, k,n) to 
be the indicator function of the event {dR(n) fl Q(z, k) ^ 0}. Define 

M(k,n) = J (z,k,n), 

zeA(fc) 

the number of squares that intersect dR(n). 

Lemma 17. There exists a constant C > such that for every k,n eN, 

E[Af(jfc,7i)] < C- max { 1,^-^4^ )• 

I k z log n > 

Proof. Fix fc, n G N. For z G Z 2 , the event {dR(n) fl Q(z, A;) 7^ 0} implies the event 

{max\\S(j)\\>\\z\\-V2(k + l)}. 
We start with an a-priori bound. Using Lemma [7J 

E[M(ra,Jfc)]< V P [||z|| < max \\S(j)\\ + V2(k + 1) 



0<i<n 

zeA(fc) 

< d - max J 1, AT 2 • E 



max H^O') II' 

0<j<n 



< C 2 -max{l,^} 
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for some constants C\,Ci > 0. Thus, we can assume without loss of generality that 
k<k + l<(n- v/^) 1 ^ < n 1 / 4 . 

Let 

r Q 0) = inf {t > : S(t) G Q{z, k + 1)} 

and let 

J(z,k,n) = l^^-^} -I(z,k,n). 

For all z G A(fc), a.s. 

n) < l{ n _^ <T()W < B } + J(z,fc,n). 
Summing over all z G A(fc), a.s. 

M(n, k) < A^/n + ^ J(z,n,k). (2.9) 

zGA(fc) 

By the strong Markov property at time tq(z) and Lemma [T5l there exists a constant 
C 3 > such that a.s. 

P [0i2(n) n Q(*, fc) ^ I r Q (*) < n - < C 3 • (2.10) 

1 logn 

By Lemma [14l there exists a constant C4 > such that for all z G Z d with 1 < ||z|| < 
5-y/rT, 

m r / x m „ log(10v/n/ Ibll) 

L J logn 

which implies 

P[J(zA n )]<C 5 -^ > g(1 °^ /MI) , (2.11) 
log n log n 

for some constant C 5 > 0. 

Denote T = 5y / ri/(2A; + 1). Summing over all z G A (A;) such that 2 < \\z\\ < b\fn, 



MiOvV \\z\\) < M2r/ v / ^+7) 

eA(fe) x,y& 
«||<5Vn 2<x 2 +y 2 <r 2 

< c 6 r iog(2r/x) < c 7 r 2 , (2.12) 



2<x<r 
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for some constants Cq,Cj > 0. Plugging (12.121) into (12.111) . and summing over all 
z G A(k) such that \\z\\ < 5y/n, we get 

£ nJ(z,k,n)]<C 8 - 1 ^-^, (2.13) 

log n K 

for some constant Cs > 0. In addition, by Lemma [TBI there exist constants Cg,C\o > 
such that for every z G A(/c) such that \\z\\ > 5-y/ri, 

a / „ Ml 2 



P [tq(z) < n - v^] < — ■ exp ( - C 10 - 



gn V n 
which implies, using ( 12.101) . 



losr n V 



log n V n 
for some constant Cn > 0. Summing over all z G A(k) such that ||2;|| > 5y/n, 

V P[ J(z, k, n)] < C n ■ ^ V expf-cj^- 
^— ' log n z — ' V n 

2eA(fc):||2||>5 v /n & zeA(fc):||z||>5 v / n 



log 2 A; n 

< Cu ■ rV ■ tj, 2.14 
log rc Ar 



for some constant Ci 2 > 0. The lemma follows by (|23|l . (12~T3|) and (12~T4D . □ 

For < n G N, let d(k,n) be the vector (/(z, fc, ri))^eA(fc)n[-2n,2n] 2 - Note that 
M(k,n)= ^2 I{ z i k i n )= I(z,k,n). 

zGA(k) zGA(fc)n[-2n,2n] 2 

Lemma 18. Let k,£,n£N and let k' = (2£ + l)k + i. Then, 

H(d(k, n) | d(k', n)) < E[M(k', n)] ■ {21 + l) 2 . 

Proof. For any z' G A(k'), the square Q(z',k') is of side length 2k' + 1 = (21 + 
1) (2 A; H- 1), and so Q(z', k') can be tiled by (2£ + l) 2 disjoint squares from the collection 
{Q(z, k)} zeA{k) . 
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If Q(z, k) C Q(z',k'), then I(z,k,n) < I(z',k',n). Thus, conditioned on the vector 
d(k',n), there are at most 2 M ( fc '' n )'( 2£ + 1 ) 2 possibilities for the vector d(k,n). By clause 
of Proposition [21, and by the definition of conditional entropy, H(d(k, n) \ d(k', n)) < 



E[M(k',n) ■ (2i+ l) 2 ]. 

Lemma 19. There exists a constant C2 > such that for all n, 



□ 



H(R(n)) < C 2 



n 



log (n) 



and (Hi) of Proposition [2] 



Proof. Since the vector d(0, n) determines R(n), clauses (if 
yield that H(R(n)) < H(d(0,n)). 

Set ko — 0, and for j > 0, define inductively fcj+i = 3fcj + 1. For every j > 1, since 
3fcj < fcj4-i < 4/cj, it holds that lo ^ fcj < 9j3 _J . Let m > be the smallest j such that 
fcj > n. The entropy of d(k m ,n) is zero. By Lemmas [T71 and [T8l for < j < m — 1, 
there exist universal constants C2, C3 > such that 



H(d(kj,n) I <9(/cj + i,n)) < c 3 ■ maxjl, 



n 



log 2 n 



(j + I) 2 
gj+i 



Using clause (Hi) of Proposition [2, there exists a constant C > such that 



m— 1 



#(0(0, ra)) < ^H(8(k 

j=0 



n) 



d(k j+1 ,n)) + H(d(k m ,n))<C 



log n 



□ 



Remark 20 . The proof of Lemma [19] shows that provided one can calculate the different 
conditional probabilities (e.g., with unlimited computational power), one can sample 
the range of a random walk using only order n/ log 2 n bits. 



3 Concluding Remarks and Problems for Further 
Research 

3.1 Extracting Entropy 

Lemma [5] shows that the entropy of R(n) in two dimensions is at least C2n/ log 2 n. It is 
interesting to note that one can extract order of n/log 2 n almost uniformly distributed 
random bits, by observing a sample of the range. We sketch the construction. 
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Consider the two configurations that appear in Figure [TJ Symmetry implies that con- 
ditioned on outside of the configuration, both have the same probability of occurring. 
Thus, any occurrence of such a configuration in the range of the random walk gives an 
independent bit, e.g., setting the bit to be 1 if the right configuration occurs, and if 
the left configuration occurs. Considerations similar to those raised in the proofs above 
show that the expected number of such configurations is of order nj log 2 n. 
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Figure 1: Two symmetric configurations. X's are vertices occupied by the range. 



3.2 Intersection Equivalence 

Consider the n xn square around in Z 2 , and consider the following procedure. Divide 
the square into 4 squares of side length n/2. Retain each of the squares with probability 
1/2, independently. Continue inductively: at level k, divide each remaining square 
of side length n2~^ k ~^ into 4 squares of side length n2~ k , and retain each one with 
probability k/(k + 1) independently. 

This procedure produces a random subset of the nxn square, denote this set by Q(n 2 ). 
In [3], Peres shows that the sets Q{n 2 ) and R(n 2 ) are intersection equivalent] that is, 
there exist constants c, C > such that for any set AcZ 2 , 

P[Q(n 2 ) n^0] 
° ~ F[R(n 2 ) n A ^ 0] " ' 

The entropy H(Q(n 2 )) is of order n 2 /log 2 (n), as is H(R(n 2 )). Note that intersection 
equivalence does not imply or follow from equal entropy. See [3] for more details. 
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3.3 Open Questions 

Let G be an infinite graph, and let {S(n)} n>0 be a simple random walk on G. Let 
R{n) = {S(0), S(l), . . . , S(n)} be the range of the walk at time n. Let H{n) be the 
entropy of R{n). 

Our results above suggest the following natural questions. 

• Assume G is vertex transitive (that is, for any two vertices x, y there exists an 
automorphism of G taking x to y). Is it true that if S(-) is transient then H{n) 
grows linearly in n? It is not difficult to produce examples of non-transitive graphs, 
that are transient but have sub-linear entropy. 

• How small can H(n) be in transient graphs? It is possible to construct (spherically 
symmetric) trees that are transient but have H{n) = 0(log 2 n). Is it possible to 
get a smaller entropy? 
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